A Study of Accessible Motifs and RNA Folding Complexity
نویسندگان
چکیده
mRNA molecules are folded in the cells and therefore many of their substrings may actually be inaccessible to protein and microRNA binding. The need to apply an accessibility criterion to the task of genome-wide mRNA motif discovery raises the challenge of overcoming the core O(n(3)) factor imposed by the time complexity of the currently best known algorithms for RNA secondary structure prediction. We speed up the dynamic programming algorithms that are standard for RNA folding prediction. Our new approach significantly reduces the computations without sacrificing the optimality of the results, yielding an expected time complexity of O(n(2) psi(n)), where psi(n) is shown to be constant on average under standard polymer folding models. A benchmark analysis confirms that in practice the runtime ratio between the previous approach and the new algorithm indeed grows linearly with increasing sequence size. The fast new RNA folding algorithm is utilized for genome-wide discovery of accessible cis-regulatory motifs in data sets of ribosomal densities and decay rates of S. cerevisiae genes and to the mining of exposed binding sites of tissue-specific microRNAs in A. thaliana.
منابع مشابه
RNA secondary structure prediction and runtime optimization
1. Background RNA secondary structure Pseudoknots Non-coding RNA 2. CONTRAfold: Probabilistic RNA folding Overview of the algorithm Details of the algorithm Performance of CONTRAfold 3. Other RNA folding methods: Physics-based models and Stochastic Context Free Grammars Physics-based models Stochastic Context Free Grammars Advantages of CONTRAfold over these other approaches 4. How RNA folding ...
متن کاملThe fastest global events in RNA folding: electrostatic relaxation and tertiary collapse of the Tetrahymena ribozyme.
Large RNAs can collapse into compact conformations well before the stable formation of the tertiary contacts that define their final folds. This study identifies likely physical mechanisms driving these early compaction events in RNA folding. We have employed time-resolved small-angle X-ray scattering to monitor the fastest global shape changes of the Tetrahymena ribozyme under different ionic ...
متن کاملMeRNA: a database of metal ion binding sites in RNA structures
Metal ions are essential for the folding of RNA into stable tertiary structures and for the catalytic activity of some RNA enzymes. To aid in the study of the roles of metal ions in RNA structural biology, we have created MeRNA (Metals in RNA), a comprehensive compilation of all metal binding sites identified in RNA 3D structures available from the PDB and Nucleic Acid Database. Currently, our ...
متن کاملDo conformational biases of simple helical junctions influence RNA folding stability and specificity?
Structured RNAs must fold into their native structures and discriminate against a large number of alternative ones, an especially difficult task given the limited information content of RNA's nucleotide alphabet. The simplest motifs within structured RNAs are two helices joined by nonhelical junctions. To uncover the fundamental behavior of these motifs and to elucidate the underlying physical ...
متن کاملRelation Between RNA Sequences, Structures, and Shapes via Variation Networks
Background: RNA plays key role in many aspects of biological processes and its tertiary structure is critical for its biological function. RNA secondary structure represents various significant portions of RNA tertiary structure. Since the biological function of RNA is concluded indirectly from its primary structure, it would be important to analyze the relations between the RNA sequences and t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Journal of computational biology : a journal of computational molecular cell biology
دوره 14 6 شماره
صفحات -
تاریخ انتشار 2006